Deep Digging into the Generalization of Self-Supervised Monocular Depth Estimation
نویسندگان
چکیده
Self-supervised monocular depth estimation has been widely studied recently. Most of the work focused on improving performance benchmark datasets, such as KITTI, but offered a few experiments generalization performance. In this paper, we investigate backbone networks (e.g., CNNs, Transformers, and CNN-Transformer hybrid models) toward estimation. We first evaluate state-of-the-art models diverse public which have never seen during network training. Next, effects texture-biased shape-biased representations using various texture-shifted datasets that generated. observe Transformers exhibit strong shape bias CNNs do texture-bias. also find show better for compared to models. Based these observations, newly design with multi-level adaptive feature fusion module, called MonoFormer. The intuition behind MonoFormer is increase by employing while compensating weak locality adaptively fusing representations. Extensive proposed method achieves datasets. Our shows best ability among competitive methods.
منابع مشابه
Self-Supervised Monocular Image Depth Learning and Confidence Estimation
Convolutional Neural Networks (CNNs) need large amounts of data with ground truth annotation, which is a challenging problem that has limited the development and fast deployment of CNNs for many computer vision tasks. We propose a novel framework for depth estimation from monocular images with corresponding confidence in a selfsupervised manner. A fully differential patch-based cost function is...
متن کاملA Compromise Principle in Deep Monocular Depth Estimation
Monocular depth estimation, which plays a key role in understanding 3D scene geometry, is fundamentally an illposed problem. Existing methods based on deep convolutional neural networks (DCNNs) have examined this problem by learning convolutional networks to estimate continuous depth maps from monocular images. However, we find that training a network to predict a high spatial resolution contin...
متن کاملQualitative Estimation of Depth in Monocular Vision
In this paper we propose two techniques to qualitatively estimate distance in monocular vision. Two kinds of approaches are described, the former based on texture analysis and the latter on histogram inspection. Although both the methods allow only to determine whether a point within an image is nearer or farther than another with respect to the observer, they can be usefully exploited in all t...
متن کاملDigging deep into “dirty” drugs – modulation of the methylation machinery
DNA methylation and histone modification are epigenetic mechanisms that result in altered gene expression and cellular phenotype. The exact role of methylation in myelodysplastic syndromes (MDS) and acute myeloid leukemia (AML) remains unclear. However, aberrations (e.g. loss-/gain-of-function or up-/down-regulation) in components of epigenetic transcriptional regulation in general, and of the ...
متن کاملFusion of stereo and still monocular depth estimates in a self-supervised learning context
We study how autonomous robots can learn by themselves to improve their depth estimation capability. In particular, we investigate a self-supervised learning setup in which stereo vision depth estimates serve as targets for a convolutional neural network (CNN) that transforms a single still image to a dense depth map. After training, the stereo and mono estimates are fused with a novel fusion m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i1.25090